Overview

Dataset Statistics

Number of Variables 16
Number of Rows 1.0486e+06
Missing Cells 1
Missing Cells (%) 0.0%
Duplicate Rows 50476
Duplicate Rows (%) 4.8%
Total Size in Memory 128.0 MB
Average Row Size in Memory 128.0 B
Variable Types
  • Numerical: 7
  • Categorical: 9

Dataset Insights

TP3 and Reservoirs have similar distributions Similar Distribution
TP2 is skewed Skewed
H1 is skewed Skewed
DV_pressure is skewed Skewed
Motor_current is skewed Skewed
Dataset has 50476 (4.81%) duplicate rows Duplicates
COMP has constant length 1 Constant Length
DV_eletric has constant length 1 Constant Length
Towers has constant length 1 Constant Length
MPG has constant length 1 Constant Length
LPS has constant length 1 Constant Length
Pressure_switch has constant length 1 Constant Length
Oil_level has constant length 1 Constant Length
Caudal_impulses has constant length 1 Constant Length
y has constant length 1 Constant Length
TP2 has 871379 (83.1%) negatives Negatives
H1 has 174579 (16.65%) negatives Negatives
DV_pressure has 977102 (93.18%) negatives Negatives
  • 1
  • 2

Variables


TP2

numerical

Approximate Distinct Count 5099
Approximate Unique (%) 0.5%
Missing 1
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16777184
Mean 1.4368
Minimum -0.032
Maximum 10.676
Zeros 83
Zeros (%) 0.0%
Negatives 871379
Negatives (%) 83.1%
  • TP2 is skewed right (γ1 = 1.8827)

Quantile Statistics

Minimum -0.032
5-th Percentile -0.018
Q1 -0.014
Median -0.012
Q3 -0.01
95-th Percentile 9.494
Maximum 10.676
Range 10.708
IQR 0.004

Descriptive Statistics

Mean 1.4368
Standard Deviation 3.2901
Variance 10.8249
Sum 1.5066e+06
Skewness 1.8827
Kurtosis 1.6581
Coefficient of Variation 2.2899
  • TP2 is not normally distributed (p-value 4.683382888616017e-25)
  • TP2 has 195538 outliers

TP3

numerical

Approximate Distinct Count 3148
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16777200
Mean 8.9635
Minimum 0.73
Maximum 10.302
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • TP3 is skewed left (γ1 = -0.7039)

Quantile Statistics

Minimum 0.73
5-th Percentile 8.126
Q1 8.468
Median 8.93
Q3 9.472
95-th Percentile 9.96
Maximum 10.302
Range 9.572
IQR 1.004

Descriptive Statistics

Mean 8.9635
Standard Deviation 0.6325
Variance 0.4
Sum 9.3989e+06
Skewness -0.7039
Kurtosis 6.1771
Coefficient of Variation 0.07056
  • TP3 is not normally distributed (p-value 0.00046825109353635774)
  • TP3 has 2329 outliers

H1

numerical

Approximate Distinct Count 2273
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16777200
Mean 7.4739
Minimum -0.036
Maximum 10.288
Zeros 105
Zeros (%) 0.0%
Negatives 174579
Negatives (%) 16.7%
  • H1 is skewed left (γ1 = -1.6808)

Quantile Statistics

Minimum -0.036
5-th Percentile -0.014
Q1 8.228
Median 8.758
Q3 9.356
95-th Percentile 9.896
Maximum 10.288
Range 10.324
IQR 1.128

Descriptive Statistics

Mean 7.4739
Standard Deviation 3.403
Variance 11.5801
Sum 7.8369e+06
Skewness -1.6808
Kurtosis 0.9897
Coefficient of Variation 0.4553
  • H1 is not normally distributed (p-value 1.4317055822659437e-06)
  • H1 has 176498 outliers

DV_pressure

numerical

Approximate Distinct Count 1950
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16777200
Mean 0.08626
Minimum -0.032
Maximum 9.844
Zeros 371
Zeros (%) 0.0%
Negatives 977102
Negatives (%) 93.2%
  • DV_pressure is skewed right (γ1 = 4.4769)

Quantile Statistics

Minimum -0.032
5-th Percentile -0.026
Q1 -0.024
Median -0.022
Q3 -0.018
95-th Percentile 0.778
Maximum 9.844
Range 9.876
IQR 0.006

Descriptive Statistics

Mean 0.08626
Standard Deviation 0.4438
Variance 0.1969
Sum 90450.358
Skewness 4.4769
Kurtosis 21.8735
Coefficient of Variation 5.1445
  • DV_pressure is not normally distributed (p-value 4.373193715267938e-25)
  • DV_pressure has 72420 outliers

Reservoirs

numerical

Approximate Distinct Count 3181
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16777200
Mean 8.964
Minimum 0.712
Maximum 10.3
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Reservoirs is skewed left (γ1 = -0.7114)

Quantile Statistics

Minimum 0.712
5-th Percentile 8.128
Q1 8.47
Median 8.93
Q3 9.472
95-th Percentile 9.958
Maximum 10.3
Range 9.588
IQR 1.002

Descriptive Statistics

Mean 8.964
Standard Deviation 0.6317
Variance 0.3991
Sum 9.3995e+06
Skewness -0.7114
Kurtosis 6.2518
Coefficient of Variation 0.07047
  • Reservoirs is not normally distributed (p-value 0.00044289508285488946)
  • Reservoirs has 2333 outliers

Oil_temperature

numerical

Approximate Distinct Count 2210
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16777200
Mean 61.1269
Minimum 15.4
Maximum 83.125
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Oil_temperature is skewed right (γ1 = 0.2652)

Quantile Statistics

Minimum 15.4
5-th Percentile 51.425
Q1 56.2
Median 60.3
Q3 65.6
95-th Percentile 73.725
Maximum 83.125
Range 67.725
IQR 9.4

Descriptive Statistics

Mean 61.1269
Standard Deviation 6.69
Variance 44.7566
Sum 6.4096e+07
Skewness 0.2652
Kurtosis -0.04396
Coefficient of Variation 0.1094
  • Oil_temperature is not normally distributed (p-value 0.00387927421347347)
  • Oil_temperature has 4271 outliers

Motor_current

numerical

Approximate Distinct Count 1653
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16777200
Mean 1.9805
Minimum 0.02
Maximum 9.295
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Motor_current is skewed right (γ1 = 0.522)

Quantile Statistics

Minimum 0.02
5-th Percentile 0.0375
Q1 0.04
Median 0.0425
Q3 3.825
95-th Percentile 5.96
Maximum 9.295
Range 9.275
IQR 3.785

Descriptive Statistics

Mean 1.9805
Standard Deviation 2.3235
Variance 5.3987
Sum 2.0767e+06
Skewness 0.522
Kurtosis -1.4189
Coefficient of Variation 1.1732
  • Motor_current is not normally distributed (p-value 8.112661229556352e-23)

COMP

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 69205950
  • The largest value (1) is over 4.77 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1048575
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 4.77 times larger than the second largest value (0)
  • COMP has words of constant length

DV_eletric

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 69205950
  • The largest value (0) is over 4.91 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1048575
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 4.91 times larger than the second largest value (1)
  • DV_eletric has words of constant length

Towers

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 69205950
  • The largest value (1) is over 10.88 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1048575
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 10.88 times larger than the second largest value (0)
  • Towers has words of constant length

MPG

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 69205950
  • The largest value (1) is over 4.61 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1048575
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 4.61 times larger than the second largest value (0)
  • MPG has words of constant length

LPS

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 69205950
  • The largest value (0) is over 407.8 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1048575
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 407.8 times larger than the second largest value (1)
  • LPS has words of constant length

Pressure_switch

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 69205950
  • The largest value (1) is over 91.93 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1048575
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 91.93 times larger than the second largest value (0)
  • Pressure_switch has words of constant length

Oil_level

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 69205950
  • The largest value (1) is over 68.24 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1048575
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 68.24 times larger than the second largest value (0)
  • Oil_level has words of constant length

Caudal_impulses

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 69205950
  • The largest value (1) is over 10.27 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1048575
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 10.27 times larger than the second largest value (0)
  • Caudal_impulses has words of constant length

y

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 69205950
  • The largest value (0) is over 27.42 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1048575
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 27.42 times larger than the second largest value (1)
  • y has words of constant length

Interactions

Correlations

Missing Values